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1- Database principles: A mapping mechanism to support bitmap index and other 
auxiliary structures on tables stored as primary B Mrees 

Eugene Inseok Chong, Chuck Freiwald, Anh-Tuan Tran, Jagannathan Srinivasan, Aravind 
Yalamanchi, Ramkumar Krishnan, Souripriya Das, Mahesh Jagannath, Richard Jiang 
June 2003 ACM SIGMOD Record, volume 32 issue 2 

Full text available: ^ pdf( 198.55 KB) Additional Information: full citation , abstract , references 

Any auxiliary structure, such as a bitmap or a B + -tree index, that refers to rows of a table 
stored as a primary B + -tree (e.g., tables with clustered index in Microsoft SQL Server, or 
index-organized tables in Oracle) by their physical addresses would require updates due to 
inherent volatility of those addresses. To address this problem, we propose a mapping 
mechanism that 1) introduces a single mapping table, with each row holding one key value 
from the prima ... 



Functional-join processing 

R. Braumandl, J. Claussen, A. Kemper, D. Kossmann 

February 2000 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 8 Issue 3-4 
Full text available: ^ pdf(486.22 KB) Additional Information: full citation , abstract , index terms 

Inter-object references are one of the key concepts of object-relational and object-oriented 
database systems. In this work, we investigate alternative techniques to implement inter- 
object references and make the best use of them in query processing, i.e., in evaluating 
functional joins. We will give a comprehensive overview and performance evaluation of all 
known techniques for simple (single-valued) as well as multi-valued functional joins. 
Furthermore, we will describe special order-preser ... 

Keywords: Functional join, Logical OID, Object identifier, Order-preserving join, Physical 
01 D, Pointer join, Query processing 



3 Automatic generation of cells for recurrence structures 
Avinoam Bilgory, Daniel D. Gajski 

June 1981 Proceedings of the eighteenth design automation conference on Design 
automation 

Full text available: l f|| pdf(478.55 KB) Additional Information: full citation , abstract , references , index terms 

This paper describes a method for automatic translation of functional into structural 
descriptions for Boolean recurrence systems. The solution of a recurrence system is 



Page 2 of 6 

accomplished by a netwo^^iat requires at most four different tyf^^bf cells. Given any 
Boolean recurrence of any order, the cell generator module defines the Boolean equations of 
these cells. 

Keywords: Boolean-recurrence solvers, Gate compilers, Logic-design automation 



4 Industry session 3: database performance and interface: A ma p ping mechanism to 
sup port bitmap index and other auxiliary structures on tables stored as primary En- 
trees 

Eugene Inseok Chong, Jagannathan Srinivasan, Souripriya Das, Chuck Freiwald, Aravind 
Yalamanchi, Mahesh Jagannath, Anh-Tuan Tran, Ramkumar Krishnan, Richard Jiang 
November 2002 Proceedings of the eleventh international conference on Information 
and knowledge management 

Full text available: ^pdf (63.19 KB) Additional Information: full citation , abstract, references , index terms 

Any auxiliary structure, such as a bitmap or a B + -tree index, that refers to rows of a table 
stored as a primary B + -tree (e.g., tables with clustered Index in Microsoft SQL Server, or 
index-organized tables in Oracle) by their physical addresses would require updates due to 
inherent volatility of those addresses. To address this problem, we propose a mapping 
mechanism that 1) introduces a single mapping table, with each row holding one key value 
from th ... 

Keywords: bitmap indexes, mapping mechanism, primary B + -trees 



7 



File organizations and access methods for CLV disks 
S. Christodoulakis, D. A. Ford 

May 1989 ACM SIGIR Forum , Proceedings of the 12th annual international ACM 

SIGIR conference on Research and development in information retrieval, 

Volume 23 Issue 1-2 

p ii ■ i -I ui « no Knn\ Additional Information: full citation , abstract , references, citings, index 

Full text available: TOpdf(1.Q8 MB) 

^ terms 

A large and important class of optical disc technology are CLV format discs such as CD ROM 
and WORM. In this paper, we examine the issues related to the implementation and 
performance of several different file organizations on CLV format optical discs such as CD 
ROM and WORM. The organizations examined are based on hashing and trees. The CLV 
recording scheme is shown to be a good environment for efficiently implementing hashing. 
Single seek access and storage utilization levels a ... 

O ptimizing multidimensional index trees for main memory access 
Kihong Kim, Sang K. Cha, Keunjoo Kwon 

May 2001 ACM SIGMOD Record , Proceedings of the 2001 ACM SIGMOD international 

conference on Management of data, volume 30 issue 2 

p ii , , -i ui « jx/n^-rn/m Additional Information: full citation , abstract , references , citings , index 

Full text available: TO pdf(243,75 KB) *~ 

terms 

Recent studies have shown that cache-conscious indexes such as the CSB+-tree outperform 
conventional main memory indexes such as the T-tree. The key idea of these cache- 
conscious indexes is to eliminate most of child pointers from a node to increase the fanout 
of the tree. When the node size is chosen in the order of the cache block size, this pointer 
elimination effectively reduces the tree height, and thus improves the cache behavior of the 
index. However, the pointer elimination cannot be ... 

Query evaluation techniques for large databases 
Goetz Graefe 
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June 1993 ACM Computingj^rveys (CSUR), volume 25 issue 2 

i- Hi ^ i wi 0i jf/n o7 Additional Information: full citation , abstract , references , citings , index 

Full text available: TO pdf(9.37 MB) *~ 

L3 "^ terms , review 

Database management systems will continue to manage large data volumes. Thus, efficient 
algorithms for accessing and manipulating large sets and sequences will be required to 
provide acceptable performance. The advent of object-oriented and extensible database 
systems will not solve this problem. On the contrary, modern data models exacerbate the 
problem: In order to manipulate large sets of complex objects as efficiently as today's 
database systems manipulate simple records, query-processi ... 

Keywords: complex query evaluation plans, dynamic query evaluation plans, extensible 
database systems, iterators, object-oriented database systems, operator model of 
parallelization, parallel algorithms, relational database systems, set-matching algorithms, 
sort-hash duality 



8 Research sessions: path indexin g : Accelerating XPath location ste ps 
Torsten Grust 

June 2002 Proceedings of the 2002 ACM SIGMOD international conference on 
Management of data 

Full text available: ^ pdf(1.12 MB) Additional Information: full citation , abstract , references 

This work is a proposal for a database index structure that has been specifically designed to 
support the evaluation of XPath queries. As such, the index is capable to support all XPath 
axes (including ancestor, following, preceding-sibling, descendant-or-self, etc). This feature 
lets the index stand out among related work on XML indexing structures which had a focus 
on regular path expressions (which correspond to the XPath axes children and descendant- 
or-self plus name tests). I ... 

9 S pecial system-oriented section: the best of SIGMOD '94: QuickStore: a hig h 
performance mapped object store 

Seth J. White, David J. DeWitt 

October 1995 The VLDB Journal — The International Journal on Very Large Data Bases, 

Volume 4 Issue 4 

Full text available: ^ pdf(2.58 MB ) Additional Information: full citation , abstract , references 

QuickStore is a memory-mapped storage system for persistent C++, built on top of the 
EXODUS Storage Manager. QuickStore provides fast access to in-memory objects by 
allowing application programs to access objects via normal virtual memory pointers. This 
article presents the results of a detailed performance study using the 007 benchmark. The 
study compares the performance of QuickStore with the latest implementation of the E 
programming language. The QuickStore and E systems exemplify the two ba ... 

Keywords: benchmark, client-server, memory-mapped, object-oriented, performance, 
pointer swizzling 



10 Design of the Mneme persistent object store 
J. Eliot B. Moss 

April 1990 ACM Transactions on Information Systems (TOIS), volume 8 issue 2 

Full text available* pdf (3.22 MB) Additional Information: full citation , abstract , references , citings , index 
• Led ^ terms , review 

The Mneme project is an investigation of techniques for integrating programming language 
and database features to provide better support for cooperative, information-intensive tasks 
such as computer-aided software engineering. The project strategy is to implement efficient, 
distributed, persistent programming languages. We report here on the Mneme persistent 
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object store, a fundamen^^fcomponent of the project, discussing ^^esign and initial 
prototype. Mneme stores objects 

11 Performance comparison of property map and bitmap indexing Q 
Ashima Gupta, Karen C. Davis, Jennifer Grommon-Litton 

November 2002 Proceedings of the fifth ACM international workshop on Data 
Warehousing and OLAP 

Full text available: ^ pdf(250.60 KB) Additional Information: full citation , abstract , references , index terms 

A data warehouse is a collection of data from different sources that supports analytical 
querying. A Bitmap Index (BI) allows fast access to individual attribute values that are 
needed to answer a query by representing the values of an attribute for all tuples 
separately, as bit strings. A Property Map (PMap) is a multidimensional indexing technique 
that pre-computes attribute expressions, called properties, for each tuple and stores the 
results as bit strings [DD97, LD02]. This paper compares t ... 

Keywords: bitmap index, data warehouse, performance study 



1 2 Scalable h ig h-speed prefix matchin g Q 
Marcel Waldvogel, George Varghese, Jon Turner, Bernhard Plattner 
November 2001 ACM Transactions on Computer Systems (TOCS), volume 19 issue 4 

Full text available: * Q pdf(933.02 KB) Additional Information: full citation , abstract , references , index terms 

Finding the longest matching prefix from a database of keywords is an old problem with a 
number of applications, ranging from dictionary searches to advanced memory management 
to computational geometry. But perhaps today's most frequent best matching prefix lookups 
occur in the Internet, when forwarding packets from router to router. Internet traffic volume 
and link speeds are rapidly increasing; at the same time, a growing user population is 
increasing the size of routing tables against which p ... 

Keywords: collision resolution, forwarding lookups, high-speed networking 



13 Data structures for efficient broker implementation | 
Anthony Tomasic, Luis Gravano, Calvin Lue, Peter Schwarz, Laura Haas 
July 1997 ACM Transactions on Information Systems (TOIS), volume 15 issue 3 

i- ii. ^ i u. 0 xc ./m Additional Information: full citation , abstract , references , citings , index 

Full text available: Tu pdf(316.45 KB) -*-* 

^ terms , review 

With the profusion of text databases on the Internet, it is becoming increasingly hard to find 
the most useful databases for a given query. To attack this problem, several existing and 
proposed systems employ brokers to direct user queries, using a local database of summary 
information about the available databases. This summary information must effectively 
distinguish relevant databases and must be compact while allowing efficient access. We offer 
evidence that one broker, GIOSS 

Keywords: GIOSS, broker architecture, broker performance, distributed information, grid 
files, partitioned hashing 



14 Building a scaleable geo-spatial DBMS: technology, implementation, and evaluation 
Jignesh Patel, JieBing Yu, Navin Kabra, Kristin Tufte, Biswadeep Nag, Josef Burger, Nancy Hall, 
Karthikeyan Ramasamy, Roger Lueder, Curt Ellmann, Jim Kupsch, Shelly Guo, Johan Larson, 
David De Witt, Jeffrey Naughton 

June 1997 ACM SIGMOD Record , Proceedings of the 1997 ACM SIGMOD international 

conference on Management of data, volume 26 issue 2 
Full text available* -Q pdfd.58 MB) Additional Information: full citation , abstract , references , citin gs, index 
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terms 

This paper presents a number of new techniques for parallelizing geo-spatial database 
systems and discusses their implementation in the Paradise object-relational database 
system. The effectiveness of these techniques is demonstrated using a variety of complex 
geo-spatial queries over a 120 GB global geo-spatial data set. 

15 Goal-oriented buffer management revisited 
Kurt P. Brown, Michael J. Carey, Miron Livny 

June 1996 ACM SIGMOD Record , Proceedings of the 1996 ACM SIGMOD international 

conference on Management of data, volume 25 issue 2 
Full text available: ^ pdf(1.56 MB) Additional Information: full citation , abstract , references , index terms 

In this paper we revisit the problem of achieving multi-class workload response time goals 
by automatically adjusting the buffer memory allocations of each workload class. We discuss 
the virtues and limitations of previous work with respect to a set of criteria we lay out for 
judging the success of any goal-oriented resource allocation algorithm. We then introduce 
the concept of hit rate concavity and develop a new goal-oriented buffer allocation 
algorithm, called Class Fencing, th ... 

16 Extensions to Starburst: objects, types, functions, and rules 
Guy M. Lohman, Bruce Lindsay, Hamid Pirahesh, K. Bernhard Schiefer 
October 1991 Communications of the ACM, volume 34 issue 10 

Full text available: ^ pdf(5.21 MB) Additional Information: full citation , references , citings , index terms 



Keywords: Extended relational database management systems, Starburst, extensible 
database management systems 



17 B-trees: bearing fruits of all kinds Q 
Beng Chin Ooi, Kian-Lee Tan 

January 2002 Australian Computer Science Communications , Proceedings of the 

thirteenth Australasian conference on Database technologies - Volume 5, 

Volume 24 Issue 2 

Full text available: Q pdf (872.95 KB) Additional Information: full citation , abstract, references , index terms 

Index structures are often used to support search operations in large databases. Many 
advanced database application domains such as spatial databases, multimedia databases, 
temporal databases, and object-oriented databases, call for index structures that are 
specially designed and tailored for the domains. Interestingly, in each of these domains, we 
find methods that are based on one distinct structure — the B-tree. Invented some thirty 
years ago, the B-tree has been challenged repeatedly, but ... 

Keywords: b-tree, high-dimensional databases, main memory databases, multimedia 
databases, spatial databases 



18 An asymptotically o ptimal multiversion B-tree 

Bruno Becker, Stephan Gschwind, Thomas Ohler, Bernhard Seeger, Peter Widmayer 
December 1996 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 5 Issue 4 
Full text available: Q pdf(151.97 KB) Additional Information: full citation , abstract 

In a variety of applications, we need to keep track of the development of a data set over 
time. For maintaining and querying these multiversion data efficiently, external storage 
structures are an absolute necessity. We propose a multiversion B-tree that supports 



http://portal.acm.org/res^^ 



Page 6 of 6 

insertions and deletions d^^ta items at the current version and r^^b queries and exact 
match queries for any version, current or past. Our multiversion B-tree is asymptotically 
optimal in the sense that the time and space bounds are ... 

Keywords: Access methods, Information systems, Physical design, Versioned data 



19 Utilization of B-trees with inserts, deletes and modifies 
T. Johnson, D. Shasha 

March 1989 Proceedings of the eighth ACM SIGACT-SIGMOD-SIGART symposium on 
Principles of database systems 

Additional Information: full citation , abstract , references , citings , index 



Full text available: ' r 

^ terms 

The utilization of B-tree nodes determines the number of levels in the B-tree and hence its 
performance. Until now, the only analytical aid to the determination of a B-tree's utilization 
has been the analysis by Yao and related work. Yao showed that the utilization of B-tree 
nodes under pure inserts was 69%. We derive analytically and verify by simulation the 
utilization of B-tree nodes constructed from N inserts followed by M modifies (where M 

20 B-trees: Compact B-trees 

Arnold L. Rosenberg, Lawrence Snyder 

May 1979 Proceedings of the 1979 ACM SIGMOD international conference on 
Management of data 

Full text available: 1 ^ pdf(928.31 KB) Additional Information: full citation , abstract, references 

A B-tree is compact if it is minimal in number of nodes, hence has optimal space utilization, 
among equally capacious B-trees of the same order. The space utilization of compact B- 
trees is analyzed and is compared with that of noncompact B-trees and of (node)-visit- 
optimal B-trees, which minimize the expected number of nodes visited per key access. 
Compact B-trees can be as much as a factor of 2.5 more space-efficient than visit-optimal 
B-trees; and the node-visit cost of a compact ... 
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Pond: the OceanStore Prototype - Rhea. Eaton, Geels, Weatherspoon.. (2003) (Correct) (3 citations) 
by an active GUID, or AGUID. Each version is a B-tree> of read-only blocks child pointers are secure 
object is stored in a data structure similar to a B-tree>, in which a block references each child by a 
oceanstore.cs.berkeley.edu/publications/papers/compressed/fast2003-pond.ps.gz 

Information Fusion in Biometrics - Ross. Jain. Qian (2001) ( Correc t) (3 citations) 

4 11,021 Impostor 72 10,953 (a) C5.0 Decision Tree, b) Linear Discriminant classifier. Table 1. 

the performance of the (a) C5.0 Decision Tree, and (b) Linear Discriminant classifier, on an 

biometrics.cse.msu.edu/RossFusion_AVBPA01.pdf 

The Alternating Decision Tree Learning Algorithm - Freund. Mason (1999) ( Correct ) (8 citations) 

semantics for representing b) c) Figure 1: Tree-based classifiers: c) a general alternating tree. 

(c) a general alternating tree, a) a decision tree, b) the same decision tree represented as an 

is the prediction associated with the root of the tree, b) A base rule r with precondition d can be in the 

www.lsmason.com/papers/ICML99-AlternatingTrees.pdf 

Multiway Range Trees: Scalable IP Lookup with Fast Updates - Suri. Va r ghese. Warkhede (2001) 
(Correct) (2 citations) 

be generalized to a multiway search tree we use a B-tree>. In a B-tree>, each node other than the root has 
to a multiway search tree we use a B-tree>. In a B-tree>, each node other than the root has at least t 
www.cs.ucsd.edu/-varghese/PAPERS/globecom2001.pdf 

Implementing l/O-Efficient Data Structures Using TPIE - Arge. Procopiuc. Vitter (2002) (Correct) 
(1 citation) 

been implemented in the second phase, including B-trees [12] persistent B-trees [9] R -trees [10] 
second phase, including B-trees [12]persistent B-trees [9] R -trees [10]CRB-trees [1]K-D -B-trees 
a data structure using TPIE. We chose the K-D-B-tree> because it is a relatively simple yet typical 
www.cs.duke.edu/%7Elarge/Papers/tpieds.ps 

Scalable Integrated Region-based Image Retrieval using IRM and.. - Wang. Du (2001) (Correct) 
(1 citation) 

using various tree structures such as K-D-B-tree> [28]quadtree [9] R-tree [11] tree [31] 

V, pp. 161-64, 1993. 28] J. Robinson, The k-d-b-tree>: A search structure for large multidimensional 

www-db.stanford.edu/~wangz/project/imsearch/SIMPLIcity/DL2001/wang2.pdf 

Purely Functional Representations of Catenable Sorted Lists ■ Kaplan. Tarjan (1996) ( Corr e ct ) 
(7 citations) 

is an ordinary balanced search tree (like an a,b-tree> for example) in which each node along the left 
described could be modified to use any kind of a,b-trees. Particularly interesting is a relation between 
returns a tree of the same height as its input tree but it may return a tree in which the root has 
www.math.tau.ac.il/~haimk/papers/loglog23.ps 

Incremental Processing of Vague Queries in Interactive.. ■ Pfeifer, Pennekamp (1994) ( Correc t) 
(3 citations) 

for example be processed very efficiently using a B*tree> for the date attribute. For the string 

by the graph in figure 1. The DATE node uses a B*tree>, the AUTHOR node interfaces to a special string 

ls6-www.informatik.uni-dortmund.de/bib/fu!ltext/ir/./Pfeifer_Pennekamp:97.pdf 

Probabilistic Top-Down Parsing and Language Modeling -Roark ( Correc t) (3 citations) 
Figure 1 Three parse trees: a) a complete parse tree b) a complete parse tree with an explicit stop 
Geman (1998) proved that any PCFG estimated from a treebank with the relative frequency estimator is 
First, it is easily reversible, i.e. every parse tree built with Gf corresponds to a unique parse tree 
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Bitmap Indices for Speeding Up - High-Dimensional Data Analysis (Correct ) 

optimised for one-dimensional queries such as the Btree whereas others are optimised for 

S. Berchtold, C. Boehm, H.P. Kriegel, The Pyramid-Tree: Breaking the Curse of D imensionality, SIGMOD 

kurts.home.cern.ch/kurts/PHD/../RESEARCH/dexa2002_bitmaps.ps 

The Rectilinear Steiner Arborescence Problem is NP-Complete - Chen (Correct ) 

Arborescence (a)a Rectilinear Steiner Minimum Tree (b)and a Rectilinear Minimum Spanning Tree (c) 

ece.tamu.edu/~wshi/pub/soda.pdf 

Slicing Floorplan with Clustering Constraint • Yuen And Evangeline (Correct) 
[5]boundsliceline -grid(BSG) 6]0-tree [2]B tree> [1] and TCG [4]have been proposed for 
Y. C. Chang, Y. W. Chang, G. M. Wu, and S. W. Wu. B*trees: A new representation for non-slicing 
www.cse.cuhk.edu. hkMyyoung/paper/tcadS.ps 

Versioned Backups and Index Concurrency - Results Of Work-in-Progress (Correct) 

of a full temporal index, the Time-Split B-tree> [LoSal, LoSa2] or TSB-tree with that of a fuzzy 

new method will undoubtedly be in its use in B trees, it can be extended to other structures. In 

nodes are split bottom up like those of a B tree>, but can be split either by time or by key. When 

ftp.research.microsoft.com/users/lomet/pub/newhpts.ps 

Dynamically Partitioned Test Scheduling for SoCs Under Power.. - Zhao. Upadhyaya (Correct ) 
partitioning a test session T2 T1 Time Power (b) tree> growing technique a test session T1 T2 T3 Time 
www.cse.buffalo.edu/~shambhu/resume/natw02.pdf 

Spatial Lesion Indexing for Medical Image Databases Using.. - Histograms Chi-Ren Shyu ( Correct) 
for objects in two to three dimensional spaces: B-trees [5]R-trees [8]and their variants [2, 20, 21, 
gray-scale, etc. These tree structures include K-D-B tree> [18]Metric tree [25] and Multihash indexing 
www.cecs.missouri.edu/~matsakis/Publications/CVPR01.pdf 

The BUB-Tree - Robert Fenk Fenk ( Correct ) 

separators of Z-regions in the index part of the B-Tree> it stores Z-intervals bounding the data stored on 
data, which inherits all good properties of the B-Tree> [BM72]Logarithmic performance guarantees are 
problem of the UB-Tree, we propose the bounding UB-Tree (BUB-Tree)an UBTree storing additional 
mistral.in.tum.de/results/publications/Fen02b.pdf 

Mixed-Integer Nonlinear Programming - Armin ( Corre ct) 

or RMINLP)Moreover, each node of the emerging B&B tree> represents a solution of the RMINLP with 
adjusted 

www.gamsworld.org/minlp/siagopt.pdf 

A Note on Parallel Algorithms for Optimal h-v Drawings of - Binary Trees Panagiotis (Correct) 
i shows three different h-v drawings of the same tree, b e c g c) a) b) d) Figure 1: A binary tree 
www.wellesley.edu/CS/pmetaxas/CGTA98.pdf 

Compressing Bitmap Indexes for Faster Search - Operations Kesheng Wu (Correct) 

schemes [6]21]30]such as the variants of B-tree> [9] or R-tree [1 1]According to the performance 

for most database systems. For example, if a B-tree> index is created for each attribute, ORACLE 

applications, bitmap indexes are better than the tree based schemes [6]21]30]such as the variants 

crd.lbl.gov/-kewu/ps/LBNL-49627.pdf 

European Organization For Nuclear Research Cern/lhcc 97-9.. - April Using An (Correct) 
18.1 B-Tree> 

of indices on selective analysis attributes. 18.1 B-Tree> Indices Indices are a way to cluster one or more 
number of objects in the tree, this means that a tree based search will scale much better than a 
wwwinfo.cern.ch/asd/rd45/reports/m3_96/milestone_3.ps 
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